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Abstract 

Iterative information processing, either based on heuristics or analytical frameworks, has been shown 
to be a very powerful tool for the design of efficient, yet feasible, wireless receiver architectures. Within 
this context, algorithms performing message-passing on a probabilistic graph, such as the sum-product 
(SP) and variational message passing (VMP) algorithms, have become increasingly popular. 

In this contribution, we apply a combined VMP-SP message-passing technique to the design of 
receivers for MIMO-ODFM systems. The message-passing equations of the combined scheme can 
be obtained from the equations of the stationary points of a constrained region-based free energy 
approximation. When applied to a MIMO-OFDM probabilistic model, we obtain a generic receiver 
architecture performing iterative channel weight and noise precision estimation, equalization and data 
decoding. We show that this generic scheme can be particularized to a variety of different receiver 
structures, ranging from high-performance iterative structures to low complexity receivers. This allows 
for a flexible design of the signal processing specially tailored for the requirements of each specific 
application. The numerical assessment of our solutions, based on Monte Carlo simulations, corroborates 
the high performance of the proposed algorithms and their superiority to heuristic approaches. 
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I. Introduction 

During the last two decades, wireless communication systems have undergone a rapid and 
steep evolution. While old analog systems mainly focused on providing voice communications, 
today's digital systems offer a plethora of different services such as multimedia communications, 
web browsing, audio and video streaming, etc. Along with the growing variety of services offered, 
the amount of users accessing them has also experienced a drastic increase. The combination 
of applications requiring large amounts of data traffic and high density of users, together with 
the scarceness of wireless spectrum resources, dictates high spectral efficiency to be an essential 
target in the design of modern wireless systems. 

From a physical layer point of view, the emergence of multiple-input multiple-output (MIMO) 
techniques [[II together with the development of near-capacity-achieving channel codes, such as 
turbo [2J or low-density parity check (LDPC) [[3l codes, have been the most remarkable steps 
towards this goal. The use of multiple antennas allows for increasing the theoretical capacity 
of a wireless channel linearly with the minimum of the number of antenna elements at the 
transmitter and at the receiver ends [4J. Depending on the specific MIMO technique employed, 
multiple antennas can be used to exploit the number of degrees of freedom of a wireless channel, 
its diversity or a mixture of both [[H. The combination with advanced channel codes enables 
transmission schemes with unprecedented high spectral efficiency. However, in order to realize 
in practice the performance predicted by theory, advanced receiver architectures combining high 
performance channel estimators, MIMO detectors and channel decoders are required. 

Joint maximum likelihood (ML) receivers are prohibitively complex for most modern commu- 
nication systems, especially systems with high MIMO order and concatenated codes. A wide- 
spread approach for the design of suboptimal, yet efficient receiver architectures is to separate 
the receiver into several individual blocks, each performing a specific task: channel weight 
estimation, noise estimation, interference cancellation, equalization or data decoding are some 
examples. Inspired by the iterative decoding scheme of turbo codes, some structures in which the 
different constituent blocks exchange information in an iterative manner have been proposed [[6l- 
[[TOl . In these receivers, each block is designed individually, and the way it exchanges information 
with the other blocks is based on heuristics. Consequently, while each block is designed to 
optimally perform its task, the full receiver structure does not necessarily optimize any global 
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performance criterion. Nevertheless, these structures have shown remarkably good performance 
at an affordable complexity, while keeping a large degree of flexibility in their design. 

Motivated by the success of heuristic iterative approaches, a set of formal frameworks for 
the design of algorithms performing iterative information processing have arisen in recent years. 
Among these, methods for variational Bayesian inference in probabilistic models have 
attracted much attention from the communication research community in recent times. These 
frameworks allow for the design of iterative algorithms based on the optimization of a global cost 
function. Typically, they are derived from the stationary points of a discrepancy measure between 
the probability distribution that needs to be estimated and a postulated auxiliary distribution, the 
latter distribution providing an estimate of the former. The different frameworks differ on the 
particular discrepancy measure selected and the restrictions applied to the postulated auxiliary 
function. We especially highlight two main approaches suggested so far in literature: belief 
propagation (BP) and mean-field (MF) method^. 

BP lfT6l is a Bayesian inference framework applied to graphical probabilistic models. In its 
message-passing form -referred to as the sum-product (SP) algorithm [17]- messages are sent 
from one node of the graphical model to neighboring nodes. The message computation rules for 
the SP algorithm are obtained from the stationary points of the Bethe free energy [[T4l . When 
the graphical model representing the system is free of cycles, the SP algorithm provides exact 
marginal distributions of the variables in the model. When the graph has cycles, however, the 
algorithm outputs only an approximation of the marginal distributions and it is, moreover, not 
guaranteed to converge [18J . In most cases, nonetheless, the obtained marginals are still a high 
quality approximation of the exact distributions. BP and the SP algorithm have found widespread 
application in the decoding of channel codes ifTTl . |fT9l , and have also been proposed for the 
design of iterative receiver structures in wireless communication systems [|20ll - [|24ll . However, 
modifications of the original algorithm are required for parameter estimation problems, such 
as channel estimation. This has been solved by, e.g., combining the SP algorithm with the 
expectation-maximization (EM) algorithm ETl . [|25ll or approximating SP messages which are 
computationally untractable with Gaussian messages [|26ll . [|27ll . 

'Some authors, e.g. Winn and Bishop 1121 . 1131 , consider BP outside the variational Bayesian framework, and usually use the 
term variational only in the context of MF-like approximations. We use, however, the more general view proposed e.g. in 1111 . 
1141 . 1151 , which considers BP as another algorithm for variational Bayesian inference. 
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MF approaches -proposed by Attias in [|28l and formulated as the variational Bayesian 
expectation-maximization (VBEM) principle by Real [|29|| - are based on the minimization of 
the KuUback-Leibler (KL) divergence [30] between a postulated auxiliary function and the 
distribution to be estimated. The minimization becomes especially computationally tractable 
under the MF approximation [|3T|| . in which the auxiliary function is assumed to completely 
factorize with respect to the different parameters. The obtained iterative algorithm guarantees 
convergence in terms of the KL divergence, but convergence to the globally optimum solution can 
only be guaranteed when the considered problem has a unique single optimum. However, it has 
proven very useful in the design of iterative receiver structures including channel estimation, e.g., 
channel estimation and detection for GSM systems [32], iterative multiuser channel estimation, 
detection and decoding B31 or channel estimation, interference cancellation and detection in 
OFDM systems [[34|[, [[35l. For other applications of MF methods, see [[36l-[[38l. Message- 
passing interpretations of this technique on probabilistic graphs have also been proposed in [12], 
[EH, ll40l and are commonly referred to as variational message-passing (VMP) techniques. 

In this contribution, we apply a hybrid message-passing framework to the design of iterative 
receivers in a MIMO-OFDM setup. This hybrid framework, recently proposed in [|4T][. [|42|. com- 
bines the SP and VMP algorithms in a unified message-passing technique. Message updates are 
obtained from the stationary points of a particular region-based free energy approximation [il4l 
of the probabilistic system. Specifically, the combined framework allows for performing VMP 
in parts of the graph and SP in others, thus enabling a flexible, yet global, design. 

From a MIMO-OFDM signal model, we derive a generic message-passing receiver performing 
channel estimation, MIMO detection and channel decoding in an iterative fashion. Channel 
estimation is not limited to the estimation of channel weights, but also includes estimation of 
the noise variance, which proves to be crucial for the operation of the receiver. The application of 
a unified framework to the whole receiver design unequivocally dictates the type of information 
that should be exchanged by the individual constituents of the receiver in the form of messages. 
This is in contrast to heuristic approaches which, for instance, arbitrarily select a-posteriori or 
extrinsic probabilities to be exchanged between the channel decoder and other modules based 
on intuitive argumentation or trends observed by simulation results [[9l, ifTOl . 

The generic messages derived can easily be particularized by applying different assumptions 
and restrictions to the signal model considered. Thus, our framework enables a highly scalable 
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and flexible design of the signal processing in the receiver. For instance, applying the messages to 
only part of the factor graph yields simplified architectures performing just a subset of the receiver 
tasks; also, small modifications to the factor- graph lead to different receiver structures with 
different performance and computational complexity tradeoffs. These properties are illustrated 
in our numerical evaluation, where the performance of a few selected instances of our proposed 
receiver is assessed via Monte Carlo simulations. The presented results demonstrate the high 
accuracy of our approach, and its superiority to iterative receivers based on heuristics. 

The remainder of the paper is organized as follows. The signal model of the MIMO-OFDM 
system considered is presented in Section [III followed by a brief review of the combined message- 
passing framework proposed in [|4T]|. [|42l| in Section|Illl In Section|lVl the generic messages to be 
exchanged in the factor-graph are derived, and the performance of five different receivers obtained 
from the generic derivation is tested in Section |Vl Finally, we draw some final conclusions in 
Section |Vll 

A. Notation 

Throughout the paper, lower-case boldface letters represent column vectors, while upper-case 
boldface letters denote matrices; (■)^ and (•)^ denote the transpose and conjugate-transpose 
of a vector or matrix respectively; || ■ || denotes the Euclidian norm; A B represents the 
Kronecker product of matrices A and B; 1^ denotes the identity matrix of dimension A^. 
Moreover, log denotes the natural logarithm; f{x) oc g{x) means that f(x) is equal to g(x) up 
to a proportionality constant; {f{x))g denotes the expectation of /(x) over g{x), i.e. {f{x))g = 
fx f{x)g{x)dx; S\s denotes all elements in the set S but s. 

II. Signal Model 

In this section a multi-user signal model for MIMO-OFDM is derived. The system is composed 
by M synchronous transmitter chains and receiver antennas, as depicted in Fig. [T] These 
transmitters can represent different transmission branches of the same physical transmitter, or 
physically separated transmitters at different locations. For the mth transmitter, a finite sequence 
of information bits u„i is encoded, yielding a sequence of coded bits c^. After interleaving, 
the interleaved coded bits are complex modulated, resulting in the vector Xm of complex- 

(v) 

modulated data symbols. Finally, the data symbols are multiplexed with the pilot symbols Xm , 
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giving the transmitted symbols = 1), . . . , x^{K, 1), . . . , L),..., x^{K, L)]^, 

where Xm{k, I) denotes the symbol sent by the mth transmitter on the kth subcarrier of the Ith 
OFDM symbol of a frame. The transmitted symbols are then OFDM modulated using an 
IFFT and the insertion of a cyclic prefix. 

The signal is transmitted through a wide-sense stationary uncorrected scattering (WSSUS) 
channel. The channel impulse response from transmitter m to receiver n during the transmission 
of the Zth OFDM symbol / can be described by 

gUl,T) = Y,c^)5{r-T^) (1) 

1=1 

where aim and Tnm are respectively the complex gain and delay of the z*"^' multipath component 
and Iran is the number of multipath components. We assume that the channel response is static 
over the duration of an OFDM symbol, but changes from one OFDM symbol to the next. Also, 
the maximum delay of each wireless link Tjlm™^ is assumed to be smaller than the duration of 
the OFDM cyclic prefij^, so that no inter-symbol interference (ISI) degrades the transmission. 
From ([T]), the sample of the channel frequency response at the /cth subcarrier of the /th OFDM 
symbol is found to be: 

1=1 

In this expression. A/ denotes the OFDM subcarrier spacing. 

At the receiver, the signal is OFDM demodulated by discarding the cyclic prefix and applying 
an FFT on the received samples. Under the previously stated assumptions that the channel is 
block fading and the maximum delays are smaller than the duration of the cyclic prefix, the 
signal received at the nth receive antenna on the /cth subcarrier of the /th OFDM symbol reads 

n = 1 

VnikJ) = '^hnm{kJ)Xm{k,l) +Wnik,l), k=l,...,K, (2) 

m=l 

/ = !,.. .,L, 

with Wn{k, /) denoting zero-mean additive complex white Gaussian noise (AWGN) with variance 
A^^. The equations in Q can be recast in a matrix- vector notation as 

M M 

y = ^ Xmhm + = ^ HmXm + W (3) 

m=l m=l 

^We assume without loss of generality that the delays are ordered in increasing order, i.e. r^'m^' > T^m- 
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Fig. 1. Block-diagram representation of the transmission model. 



where y = [yj, yj^^, with y^ = [yn{l, 1), • • • , yn{K, 1), . . . , ?/„(!, L),..., yn{K, L)f denot- 
ing the received signal at the nth receive antenna for a frame of K subcarriers and L OFDM sym- 
bols. Additionally, hm = [hj^, • • • , hjjj^, X„ = JAr®diag{a;„}, = [diag{h,i^}, . . . , diagih^mW 
and hnrn = [hnm{l,l),...,hnrniK,l),...,hn,rnil,L),...,hnniiK,L)]'^. Equation Q can be 
further compressed as 

y = Xh + w = Hx + w 
where x = [xj, a;^/]'^, h = [h], . . . , hl^]'^, X = [Xi, Xm] and H = [Hi, . . . , Hm]. 

III. Message Passing Techniques 

In this section, we briefly introduce message-passing techniques on factor graphs. First, we 
define the concept of factor graph on a probabilistic model, followed by the description of 
two standard message-passing schemes: the sum-product (SP) algorithm IfTTll and the variational 
message-passing (VMP) algorithm 1121. Finally, we show how to combine both algorithms to 
perform hybrid VMP and SP message passing in a factor graph [|4T]| . 



November 28, 2011 



DRAFT 



8 



A. Factor Graphs for Probabilistic Models 

Let p{z) be the probability density function (pdf) of a vector z of random variables Zi (i E I) 
which factorizes according to 

where Za = {zi\i E J\f{a)y with J\f(a) C X for all a G ^ and Z = Hae^ ^"(^a)'^^ is a 
normalization constant. We also define A/'(i) = {a E A\i E A/'(a)} for all i E I. Similarly, 
J\f{a) = {i E X\a E A/'(i)} for all a E A. The above factorization can be graphically represented 
by means of a factor graph [17]. A factor graphic is a bipartite graph having a variable node i 
(typically represented by a circle) for each variable zi, i E X and a factor node a (represented 
by a square) for each factor fa, a E A. An edge connects a variable node i to a factor node a 
if, and only if, the variable Zi is an argument of the factor function fa- The set A/'(i) contains 
all factor nodes connected to a variable node i E X and J\f(a) is the set of all variable nodes 
connected to a factor node a E A. 

Factor graphs provide a compact and intuitive representation of the statistical dependencies 
among the random variables in a probabilistic model. Furthermore, they enable the design of 
a class of iterative signal processing algorithms which are based on the nodes of the graph 
iteratively exchanging information (messages) with their neighbors (connected nodes). This class 
of algorithms has been coined message-passing techniques, and in the following we will describe 
the two instances of these techniques which have been most widely applied to signal processing 
for communication systems: the SP and VMP algorithms. 

B. The Sum-Product Algorithm 

The SP algorithm is a message-passing algorithm that computes the exact marginal distribu- 
tions Pi{zi) of the variables Zi associated to the joint distribution p(z) for tree-shaped factor 
graphs. When the factor graph does not have a tree structure, the outcome of the algorithm is 
only an approximation of the true marginal, and the approximate marginals bi{zi) ^ Pi{zi) are 
called beliefs. The message-passing algorithm is derived from the equations of the stationary 
points of the constrained Bethe free energy [fT4l|. 

^We will use Tanner factor graphs 1171 throughout this article 
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The algorithm operates iteratively by exchanging messages from variable nodes to factor nodes 
and vice-versa. The message computation rules for the SP algorithm read 

Tli^aiZi) = Y\_ TTT^c^ii^i), EZ,aE 
ceAf{i)\a 

where da {a E A) are positive constants ensuring that the beliefs are normalized to one. Often 
the constants da need not be calculated explicitly, and it is enough to normalize the beliefs after 
convergence of the algorithm (see L42J for more details on normalization issues). We use the 
notation ^(. )->.(.) for output messages from a variable node to a factor node and m(.)_j.(.) for input 
messages from a factor node to a variable node. This convention will be kept through the rest 
of the paper, also for other message-passing schemes. 

The variables' beliefs can be calculated at any point during the iterative algorithm as 

h{zi)= JJ ma-,i{zi) \I%eX. 

The SP algorithm acquired great popularity through its application to iterative decoding of, 
among others, turbo codes and LDPC codes, and has since then been used for the design of 
many iterative algorithms in a wide variety of fields [|2T|| . 

C. The Variational Message-Passing Algorithm 

The VMP algorithm is an alternative message-passing technique which is derived based on the 
minimization of the variational free energy subject to the mean-field approximation constraint 
on the beliefs. While it does not guarantee the computation of exact marginals (even for tree- 
shaped graphs), its convergence is guaranteed by ensuring that the variational free energy of the 
computed beliefs is non-increasing at each step of the algorithm [fT4ll . 

The operation of the VMP algorithm is analogous to the SP algorithm; the message compu- 
tation rules read 

rua^iizi) = exp{\ogfa{Za))u^^^^,^^^n,^^, WaEA,iE Af{a) (5) 
rii^aizi) = ei Y\_ mc^i{zi) El,a E Af{i) (6) 
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where Cj (i E I) are positive constants ensuring that rii^a are normaUzed. As in the SP algorihtm, 
the behefs can be obtained as 

c€jV{i) 

The VMP algorithm has recently attracted the attention of the wireless communication re- 
search community due to its suitability for conjugate-exponential probabilistic models lfT2l . The 
computation rule for input messages from factor to variable nodes allows for the obtention of 
closed-form expressions in many cases in which the SP algorithm typically requires some type 
of numerical approximation. 

It is shown in [42J that a message-passing interpretation of the EM algorithm can be obtained 
from the VMP algorithm. Assume that for a certain subset of variables Zi, i E £ C I we want to 
apply an EM update while still using VMP for the rest of variables. To do so, the beliefs bi are 
restricted to fulfill the constraint bi{zi) = S{zi — Zi) for dX\ i E £ additionally to the mean-field 
factorization and normalization constraints. Minimizing the variational free energy subject to 
these conditions leads to a message passing algorithm identical to the one described in (|5]) and 
© except that the messages rii^a for dX\ i E £ and a E A/'(i) are replaced by 

ni^a{zi) = 6{zi - Zi) with Zi = argmax^^ I J]^ ma-^i{zi) . (7) 

\aGA^(i) / 

D. Combined VMP-SP Algorithm 

As stated previously in this section, the VMP and the SP algorithms are two message-passing 
techniques suitable for different types of models. While SP is especially suitable in models 
with deterministic factor nodes, e.g. code or modulation constraints, VMP has the advantage 
of yielding closed-form computationally tractable expressions in conjugate-exponential models, 
as are found in channel weight estimation and noise variance estimation problems. Based on 
these facts, it seems natural to try to combine the two methods in a unified scheme capable of 
preserving the advantages of both. 

A combined message-passing scheme based on the SP and VMP algorithms was recently 
proposed in [|4T1l . [|42ll . This hybrid technique is based on splitting the factor graph into two 
different parts: a VMP part and a SP part. To do this, part of the factor nodes are assigned to 
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the VMP set (^vmp) and the rest are assigned to the SP set (^sp)- Given this classification, we 
can express the probabilistic model in @) as 

VMPpart SPpart 

p(^) = ^ n Z'^^^") n /^(^^) 

ae^vMP ceAsp 

where ^vmp U ^sp = and ^vmp H ^sp = 0. By applying the Bethe approximation to the 
SP part and the mean-field approximation on the VMP part, a new message-passing scheme 
is derived from the stationary points of the region-based free energy ll4T]| . [|42ll . The message 
computation rules for this algorithm read 

ma^tfi^i) = exp{\og fa{za))Yi^^^^^^^^n,^^, Va e Aymp, i e Af{a) (8) 

mf:,,{Zi) = rfa(/a(2a))n,,AAWV«.-.' ^« ^ '^SP,^ G A/'(a) (9) 

n,^a{z^) = e, H m^^fizi) J] rn^^,{z,) ^ieI,aeAf{i) (10) 

ceA^(i)n^vMP ceJ\f{i)nAsp\a 

where, again, da and Cj are positive constants ensuring normalized beliefs. The computation rules 
for messages outgoing factor nodes are preserved: for factor nodes in the VMP part (a G ^vmp) 
the messages are computed using ([8]) as in standard VMP; for factor nodes in the SP part 
(a G ^sp) the messages are computed via Q, which corresponds to a standard SP message. 
A message from a variable node i to a factor node a is computed as a VMP message when 
a G ^vMP and as a SP message when a G Asp, as can be deduced from (flOl) . 

As with the VMP and SP algorithms, the beliefs of the variables can be retrieved at any stage 
of the algorithm as 

aeJ^(i)nAYMP a&j\f{i)nAsp 
Note that we can apply the EM restriction to the belief of variables Zi which are only connected 
to VMP factors (i.e. A/'(i) fl ^sp = 0)- In that case, the message update rules remain the same 
except that the message ni^a in (fTOl) is replaced by © for the selected variables. 

IV. MIMO-OFDM Receiver Based on Combined VMP-SPA 

In this section, we present a generic iterative receiver for MIMO-OFDM systems based on 
the mixed VMP and SP message-passing strategy outlined in Section ITlI-DI Recalling the signal 
model presented in Section HIl we can now postulate the probabilistic model to which we will 
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apply the combined VMP-SP technique. In our case, we identify the observation to be the 
received signal vector y. As unknown parameters, we include the vector of information bits 



u 



\u 



1' 



, Uj^j] , the vector of coded bits c 



,T T IT 

'1 ! • • • ) '-A/J ' 



the vector of modulated symbols 
X = [xi, . . . , XmY, the vector of complex channel weights h = [hi, . . . , and the AWGN 
precision A. The system function of our model is the joint pdf of all parameters, which can be 
factorized as 

p{u, c, X, h, A, y) = p{y\h^ x, X) p{h)p{X) p{x,c, u)^ (1 1) 

fo fc /n /m 

where we have chosen to group the factors on the right-hand side into four functions. Factor 
/o(y, h, X, A) = p{y\h, X, A) denotes the likelihood of the channel weights h, the noise precision 
A and the transmitted symbols x given the observation y. Factor fc{h) = p{h) contains the 
assumed prior model of the channel weights, which is relevant for channel weight estimation. 
Function /n(A) = p{\), likewise, contains the assumed prior model for the noise precision 
parameter A which defines how estimation of the noise precision is done. Finally, function 
/m(cc, c, u) = p(x, c, u) denotes the modulation and code constraints. Note that further factor- 
ization of the factors in (fTT)) is possible and will, in fact, be used later in this section. 

A schematic factor- graph-like representation of the model in (fTTI) is depicted in Fig. [2l The 
observation factor node fo is connected to three ovals: channel weights, noise precision and 
modulation and coding. Each of the ovals represents a subgraph corresponding to factors fc, /n 
and /m in (fTTI) . The three subgraphs are connected to fo, which reads 

fo{y,x,h,X) oc A™exp{-A||j/-X/if } = A™exp {-A||j/ - //xf } . 
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Each of the subgraphs in Fig. [2] will be detailed in the remainder of this section. For now, we 
define the sets and as the set of factor nodes inside the channel weights, noise 

precision and modulation and coding subgraphs respectively. Likewise, we define the sets Zq, Xn 
and Xm as the set of variable nodes inside the channel weights, noise precision and modulation 
and coding subgraphs respectively. With these definitions, the set of all factor nodes in the graph 
is given b>0 

^= {/o}U^cU^nU^m, 
and the set of all variable nodes reads 

X = Xq U X]\j U Xm . 

From the observation factor node /o, sets of messages Mq, and Mm are sent to the 
respective subgraphs. These sets are composed of individual messages nif^^^z, z E X. The 
specific composition of the sets of messages depends on the exact configuration of variable and 
factor nodes of the corresponding subgraph, which will be described later in the section. After 
processing is completed at each subgraph, sets of messages Nq, A/'n and Nm, which correspond 
to the updated estimates of the channel weights, the noise precision and the transmitted symbols 
respectively, are send back to /q. 

In order to apply the combined VMP-SP algorithm, we need to define which factor nodes 
are assigned to the VMP set v4vmp and which are assigned to the SP set .4sp. We select the 
following splitting: 

^VMP ={/o} U U 

i.e. the observation factor node and all factors in the channel weight and noise precision subgraphs 
are assigned to the VMP set, and all factor nodes in the modulation and coding subgraph are 
assigned to the SP set. 

In the remainder of this section, we will present the details of each of the subgraphs, with 
several alternative factor-graph representations yielding different message-passing configurations. 

''with a slight abuse of notation, from this point on we use the names of functions and variables as indices of the sets A and 
I respectively. 
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Fig. 3. Subgraph corresponding to the noise precision prior model. 

The performance of the individual receiver structures obtained will be evaluated and compared 
in Section |Vl 



A. Noise Precision Subgraph 

The noise precision subgraph is the graphical representation of /n in (fTTI) . which we specify 
now as 

/n(A)^p(A) 

where p(A) denotes the prior distribution of A. With this, we can now specify the sets 

={/n} 

Xn ={A}. 

The factor graph representation of the subgraph is depicted in Fig. [3l It only consists of the 
variable node A and the factor node /n. Since there is only one variable node connected to fo, 
the set of messages Mj<t reduces to Mj^ = {nif^^x}- Analogously, = {nx^f^}. 

According to the message-computation rules given in Section |nil the message transmitted 
from fo to A is calculated as 



"^/o^a(A) = exp{(log/o(2/,a;,^, A))jVcArM} = A exp{-XA} 



(12) 



with 



lH ' h . 



A=\\y - Xh\\' + Tr\B CB + B H /f B ^ + Tr <^ XS^X 
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In the above expression, h = {h)^^, H = {H)n^, x = {x)^^^, X = {X)n^ are the means of 
h, H, X and X respectively taken with respect to the channel weights and modulation and coding 
output messages. Moreover, = {hh^)^^ — hh , and C = {H^H)]sr^ — H H. Finally, 
B = C/A^/^ where A is the diagonal matrix of eigenvalues and U is the matrix containing the 
eigenvectors of Sg. = (xx^) — xx^, i.e. Sa, = UAU^. 

The message in (fT2l) is proportional to the pdf of a complex central Wishart distribution of 
dimension 1, KLN + 1 degrees of freedom and associated covariance A^^ [|43l . We select the 
prior pdf p{\) to be conjugate, i.e., a complex Wishart. This yields the message 

Given the two incoming messages nif^^x and nif^^x, the outgoing message from A is also 
proportional to a complex Wishart pdf 

nA^/o(A) oc m/^^A(A)m/o^A(A) cx A™+°-^ exp{-A(A + Apnor)}- 

Since usually no prior information on the noise precision is available at the receiver, we select 
p{X) non-informative with parameters a = and Apnoi = 0. With this choice, the mean of A 
with respect to A/'n reads 

^ KLN 

A = Wn, = (13) 

Note that the above update for A coincides with the ML estimate of the noise precision. Since, as 
we will see later in the section, only the first moment of A is needed to compute other messages, 
it is sufficient to pass just this value to the rest of the graph. 

B. Channel Weights Subgraph 

The channel weights subgraph includes the graphical description of the factor /c in (fTTI) . We 
will present in the following two alternative subgraphs representing two possible definitions of 
fc'- in the first one, coined jom? channel weights subgraph, all channel weights for all transmit 
antennas are grouped together in a single variable node h\ in the second one, which we refer to 
as disjoint channel weights subgraph, the weights are split into M variable nodes hi, ... , Hm 
each of them containing the channel weights associated with an individual transmit antenna. 
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Joint Channel Weights 
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Fig. 4. Subgraph corresponding to the prior model of the joint channel weights. 

1 ) Joint Channel Weights Model: The joint channel weights subgraph is obtained from the 
following definition: 

fc{h)^p{h) 

with p{h) denoting the prior pdf of the vector of channel weights h. Using this model for /c 
leads to defining the factor and variable node sets as 

Ac ={/c} 
Xc ={h}. 

The factor graph describing the joint channel weight option is presented in Fig. ID As there is 
only one variable node connected to the factor node /o, the set of input messages to the channel 
weights subgraph is simply Mq = {^fo^h] and the set of output messages is the singleton 

The message from /o to h is given by 

mfo^h{h) = exp{{\ogfo{y,x,h, X))NMNt,} ocexp|-A (^\\y - Xhf + h^Dh^^ 

with D = {X^X)n^ — XX. Hence, mf^^hih) is proportional to a Gaussian pdf. We also 
impose the prior p{h) to be Gaussian, which yields the message 



"priory ^hprio/^ ^prior) 



For most practical channels it is reasonable to assume that h 



prior 



0. The receiver needs an 



estimate of the prior covariance of the channel S^, . In order to obtain the outgoing message 
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nh^f^ih), the two incoming messages are combined, leading to 

rih^f^ih) oc mf^^h{h)mf^^h{h) oc exp <-{h - h)^t:^^{h - h). 



} 



Thus, rih^f^ is proportional to a Gaussian pdf with covariance matrix 



and mean value 



h 



±H (AX y + ^hl^hpnor 



) 



2) Disjoint Channel Weights Model: The disjoint channel weights subgraph is obtained by 
factorizing /c with respect to each transmitter. More specifically, we define 

M 



with fc^{hm) — p{hm), m = 1, . . . , M denoting the prior pdf of the channel weights for the 
mth transmit antenna. We also specify the sets 



Fig. [5] shows the factor graph of the disjoint channel weights model with the above definitions. 
With this configuration, the channel weight vector h is split into M variable nodes hi, ... , Hm, 
each of them containing the weights associated with one transmit antenna. Each of these vari- 
able nodes is furthermore connected to a factor node /c^. Due to this separation, the set of 
incoming messages reads Mq = {m/Q_>^i^|m = 1, . . . , M}, while the set of outgoing messages 
is Nc = {nh^^f^\m = 1, . . . , M}. With this structure, the channel weight vectors are estimated 
sequentially by iterating through the transmit antenna index m. 
For the mth transmit antenna, the incoming message reads 



fc{h) = n fcjhj 



m=l 



Ac ={fcJm = l,...,M} 



Ic={hm\m = l,...,M}. 




where N^T^ = {?^fi,^,^/o}vm'=i,...,A/ denotes the set of all output channel weight messages 



except the mth one. Furthermore, = (/im') (m), X^ = {X^)n^ and Dm = (X"X„)jvm- 
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Fig. 5. Subgraph corresponding to the prior model of the disjoint channel weights. 

X^Xm- Again, rnj^^hm is observed to be proportional to a Gaussian pdf. Analogously to the 
joint channel weights case, we need to specify the prior of each individual channel vector h^- 
Defining them as Gaussians leads to the message 

^/Cm-^^m (^™) ~ P{^m) OC exp "l^ — — /l-m, prior) hm.pdoS^'^ ~ ^m.prior) 

where, once more, the receiver requires estimates of the prior parameters of the channel for 
each transmitter. The outgoing message from variable node is obtained by multiplying both 
incoming messages, leading to 

which equals, up to a proportionality constant, a Gaussian pdf with covariance matrix 



-1 

m , prior 



and mean value 



hm — ^h-rr, ( ( y - ^ Xrn'hm' 1 + ^_._^_7lm,prior 1 • 

It is important to note that every time a new message Uh^^fo is computed, the set of messages 
Mc needs to be recomputed again, as all nif^^h^i^m' ^ m depend on the updated messages 
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C. Modulation and Coding Subgraph 

The modulation and coding subgraph describes the factor /m in (fTT)) . We choose to factorize 
this factor according to 

/M(a;, c, u) = 



where f-p^{xm) = p{xm) denotes the prior pdf of the pilot symbols transmitted from the 
mth transmitter, fMmi^m, Cm,i, • • • , Cm,Cm) - p{xm\cm,i, Cm,c^) denotes the modulation 
constraints on the data symbols of the mth transmitter, fc^{cm,i, • • • , Cm,c™^, Um,i, • • • , Um,u„^) — 
p{cm,i, ■ ■ ■ , Cm,Cm |wm,i, • • • , Um,u„i) represents the code constraints for the mth codeword and 
fu^i{'^rn,i) — p{um,i) IS the prior pdf of the ith information bit transmitted by the mth antenna. 
In addition, the vectors Xm^ and Xm^ contain, respectively, the modulated pilot and data symbols 
transmitted from the mth antenna. Finally, Cm and Um denote the number of coded and informa- 
tion bits respectively transmitted in a codeword from the mth antenna. Using this factorization 
of fu, we define the sets and Xm as 

•Am ={fv^ |m = 1, . . . , M} U \m = 1, . . . , M} U {fcjm = 1, . . . , M} 

U{fu,Jm = l,...,M,t = l...Um} 
Xm ={x^^^\m = 1 . . . , M} U {a^^V = 1 • • • , M} U {c„,i|m = 1, . . . , M, i = 1 . . . C„} 

U {um,i\m = 1, . . . , M, i = 1 . . . Um}- 

The factor graph with the modulation and coding constraints is shown in Fig. [6l As it can be 
observed, the modulated symbols have been separated into different variable nodes according 
to the transmit antenna index m from which they are sent. The symbols corresponding to each 
transmit antenna port have been further subdivided into two different variable nodes Xm^ and Xm, 
the first containing the pilot symbols and the second containing the modulated data symbols. The 
modulated data symbols Xm^ are connected to the encoded bits Cm,i, ■ ■ ■ , Cm,Cm the modulation 
factor node which describes the mapping of bits onto a complex constellation. The coded 

bits are, in turn, related to the information bits Um,i, ■ ■ ■ ,Um,u,n through the specific channel 
code and interleaving scheme utilized, which is represented in a simplified manner by the factor 
fc^ in Fig. [6l Finally, every information bit Um,i has an associated prior probability represented 
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Fig. 6. Subgraph corresponding to tiie modulation and coding constraints. 



by the factor node furai- For the vast majority of applications, however, the values of the bits 
will be assumed to be equiprobable. With the proposed structure, the set of incoming messages 
is defined as Mm = \f^f b) |m = 1, . . . , M ^ U < m (d) |m = 1, . . . , M L while the set of 

1^ Jo ^-^m J JO J 

outgoing messages becomes Nu = \ n (p) \m = 1, . . . , M \ U \ n {d) \m = 1, . . . ,M\. 
In order to ease the derivation of the messages for this subgraph, we can re- write /o(2/, h, A) 



as 



M 



M 



Uy, X, h, A) oc A^^^ exp \ -A y^'^ - H^l^^^'^ 



m=l 



y 



(p) 



m=l 



(P)^(P) 

m m 
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where the contribution of pilot and data symbols has been split into two separate terms. We start 
by computing the message that factor node /o sends to Xm'- 



(a^m ) = exp I (log fo{y, X, h, A))j^^^^j^{m) } 



oc exp < —A 



m'^m / ) 

In the above expression, and similarly to previous definitions, a;^,^ = {x]^,)nm^ H^m' = (-^w )iVc' 

Additionally, A/'l?'' = {n (p) |i = 1,...,M} U in (d) |z = 1, . . . , M, i 7^ m} denotes 
the set of all outgoing detection messages except n (d) ^ . The message in (fT4)) is proportional 
to a Gaussian pdf with covariance matrix 



Zj (d) 

""my MP 

and mean 



J .Mm ^-^m 



The outgoing message n (d) , (xm^) is obtained by multiplying the messages m (d)(xm) 
and m (d). In this case, (d) is a SP message reading 

J Aim '^•^rn JJ^m ^^"m 

i=l \se5m / 

where Sm is the modulation set for user m and represents the extrinsic values of 

Xm\i) for each constellation point s G Sm, obtained from the SP demodulator and decoder. The 
combined message fed back to the observation factor node reads 

n (d)^r (xl^^) ocm, ^ (d)(cc^^)m, ^ (d)(xl^^) 



oc 

i=l \se5, 



where a^,^) (i) is the ith entry in the main diagonal of S (d) . It can be observed that the message 
factorizes with respect to the individual modulated symbols Xm\i), so the mean and variance 

November 28, 2011 DRAFT 



22 



of each data symbol can be computed independently and used to build the mean vector x and 
the covariance matrix Ti^ by inserting the updated mean and variances in their corresponding 
positions. 

It is important to note that, because the factor node f^m ^ SP factor node, the message 
?7,^(d)_^^^ is obtained by multiplying all messages received at variable node Xm^ except the 
message coming from /a^„, which in this particular setup reduces to 

n {d)^r (iCm^) =fnr ^ (d)(a;^^). 

All message-passing among the modulation factor nodes, coded bits and information bits is 
completed by using the standard SP algorithm, and will therefore not be described here. 

It remains to describe the income and outcome messages involving pilot symbols. As pilot 
symbols are known by the receiver, their prior distribution is p(xm''(0) = ^{p^^mii) — Pmii)) 
with Pm{i) denoting the ith pilot symbol sent from transmit antenna m. This imposes that the 
outgoing message n^p)^^^^ is also a Dirac delta, which can also be described as a degenerate 
Gaussian message with mean x'^^ = and covariance S (p) = 0. 

V. Simulation Results 

In this section, we propose a number of receiver structures based on the derivations made 
in Section |IV] and evaluate their performance by means of Monte-Carlo simulations. First, we 
present the parameters of the MIMO-OFDM system considered, followed by a description of the 
specific receiver structures that will be evaluated. Finally, the BER performance results obtained 
are presented and discussed. 

A. Description of the MIMO-OFDM System 

We begin by describing the MIMO-OFDM system used for obtaining the numerical results. 
Its main parameters are summarized in Table HI We consider an OFDM system with M = N = 2 
antennas at both transmitter and receiver ends. Two streams of random bits are independently 
encoded using a convolutional code with rate 1/3 and generating polynomials 133, 171 and 
165 (octal). After channel interleaving, the coded bits are mapped onto symbols of a QPSK 
or 16QAM constellation (with Gray mapping) which are then inserted into an OFDM frame 
consisting of L = 7 OFDM symbols with K = 75 subcarriers and with a subcarrier spacing of 
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TABLE I 

Parameters of the simulated OFDM system 



Parameter 


Value 


Tx antennas (M) 


2 


Rx antennas (A^) 


2 


Subcarriers (K) 


75 


OFDM symbols (L) 


7 


Subcarrier spacing (A/) 


15 kHz 


Channel coding 


1/3 Convolutional 


Symbol mapping 


16QAM 


Pilot symbols 


13 


Channel model 


3GPP ETU 



15kHz. Part of the time-frequency elements are reserved for the transmission of pilot symbols. 
We specify the following pilot patterns: pilot symbols are transmitted in the first and fifth OFDM 
symbol of the frame, with a frequency spacing of 12 subcarriers, resulting in a total of 13 pilot 
symbols per frame. Note that both transmit antennas share the same time-frequency elements 
for the simultaneous transmission of pilot symbols. Pilot symbols are randomly chosen from a 
QPSK constellation. 

Realizations of the channel time-frequency response are randomly generated using the ex- 
tended typical urban (ETU) model from the 3GPP LTE standard [|44| with 9 Rayleigh-fading 
taps. The channel responses corresponding to two different transmitters are uncorrelated and 
remain static over the duration of an OFDM frame. A new channel response is generated for 
each OFDM frame, with the responses of two different frames being also uncorrelated. 

B. Receiver Structures 

We introduce now the specific receiver architectures that will be evaluated in this section. 
All receivers are based on the generic message-passing receiver presented in Section |IVl The 
messages exchanged can be obtained by particularizing the generic messages according to the 
specific receiver configuration, as it will be detailed in the following. We evaluate three main 
types of VMP-SP receiver, which are described next. 

1) I-DJC-DD and I-DSC-DD Receivers: First, we introduce a full iterative receiver using 
exactly the messages derived in Section |IVl The receiver operates by iteratively updating the 
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beliefs of the channel weight vector, the data symbols and information bits and, finally, the noise 
precision parameter. 

Initialization of the beliefs of the channel weights and the transmitted symbols is required. 
The initialization of the channel weights is obtained from a pilot-based joint linear minimum 
mean-squared-error (LMMSE) channel estimator. For the initialization of the transmitted sym- 
bols, maximum-likelihood detection (MLD) is used, followed by soft-in soft-out (SISO) BCJR 
decoding. The belief of the transmitted data symbols is set to a Gaussian pdf with mean and 
covariance values obtained from soft modulation of the a-posteriori probabilities (APP) of the 
coded bits obtained from the SISO BCJR decoder. An initial estimate of the noise precision is 
obtained as in Section IIV-AI After the initialization, a full iteration of the receiver consists of 
updating the beliefs of the channel weight vectors (using either the joint channel weight model in 
Fig. m or the disjoint channel weight model in Fig. [5]), a message-passing run on the modulation 
and coding subgraph (updating the beliefs of transmitted symbols, coded bits and information 
bits) and, finally, an update of the noise precision parameter. Note that the message-passing 
operations done through the channel code factor node can be replaced by SISO BCJR decoding. 
In this case, the SP messages ric^ k^fM,n identified to be the extrinsic values of the coded 

bits output by the BCJR decoder. 

We refer to the described architectures as Iterative - Data-aided Joint Channel estimation - 
Data Decoding (I-DJC-DD) for the receiver using the joint channel weights model and Iterative - 
Data-aided Sequential Channel estimation - Data Decoding (I-DSC-DD) for the receiver obtained 
using the disjoint channel weights model. 

2) DJC-DD and DSC-DD Receivers: We introduce now a class of receivers which perform 
iterative data-aided channel weights and noise precision estimation together with equalization and 
demodulation of the transmitted symbols. Compared to the receivers presented before, channel 
decoding is left outside of the iterative process, and is performed only once at the end after 
convergence of the algorithm. The receiver capitalizes on just the knowledge of the complex 
modulation structure of the transmitted signal to refine its channel estimates, and not on the code 
structure. This receiver architecture is obtained by applying a special scheduling to the message 
computation and exchange between the subgraphs. Specifically, no messages are passed from 
variable nodes x^m to factor nodes until the last iteration of the algorithm. Instead, after 
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the messages m (d) are computed, the updated message n (d) is directly computed using 
(fT6l) . To this end, an initial value of the messages m (d) is needed. This can be obtained 
by setting 

PJ'')(As) = 7^ "im = 1, . . . , M,i = 1, . . . , Nd, s e Sm 

W \Sm\ 

in (fT5] ). In the expression above, \Sm\ denotes the cardinality of the set Sm- Note that this 
initialization corresponds to assuming that all modulated symbols in the constellation are equally 
likely, which is a valid assumption when the information bits are equiprobable and the channel 
code is regular. 

As for the previous receivers, an initialization of the beliefs of the channel weight vector, 
noise precision and transmitted symbols is required. The channel weight vectors are initialized 
as a Gaussian pdf, with mean obtained from a pilot-based LMSSE channel estimator and 
null covariance. Similarly, the beliefs of the transmitted symbols are also set to a Gaussian 
pdf with mean and covariance values obtained from a MIMO MLD (no BCJR decoding is 
done, as opposed to the I-DJC-DD and I-DSC-DD receivers). An initial estimate of the noise 
precision is then obtained following the procedure in Section IIV-AI After the initialization, the 
receiver operates by iteratively updating the beliefs of the channel weights (either jointly as 
in Fig. m or sequentially as in Fig. [5]), the transmitted symbols and noise precision parameter. 
After convergence of the algorithm (or maximum number of iterations attained), the messages 
n (d) , are computed, and a round of decoding based on the SP algorithm is performed, 
yielding the beliefs of the information bits. 

We refer to these receivers as Data-aided Joint Channel estimation - Data Decoding (DJC- 
DD) for the receiver using the joint channel weight prior model (Section IIV-BII) and Data-aided 
Sequential Channel estimation - Data Decoding (DSC-DD) for the receiver using the disjoint 
channel weight prior model (Section IIV-B2I) . 

3) PSC-DD Receiver: Finally we present a simple receiver consisting of a pilot-aided chan- 
nel estimator, a MEMO maximum likelihood detector (MLD) and data decoding. The channel 
estimation module is based on the VMP-SP generic receiver described in Section |lVl It updates 
iteratively the beliefs of the channel weight vectors corresponding to each transmit antenna and 
the noise precision. To this end, the channel estimator only exploits the pilot signals transmitted 
from each transmit antenna and does not capitalize on data symbols to refine its estimates. 
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In order to obtain this pilot-aided channel estimator from the generic receiver architecture in 
Section |Wl the messages n (d) , must be set to 



*fo 



This enforces that data symbols are not employed for channel weight estimation. In addition, 
the disjoint channel weights setup (see Fig. [5]) is selected. With this configuration, the output 
messages Nu are constant, reflecting the receiver's knowledge on the value of the pilot symbols. 
Hence, expectations taken over A/"m in the channel weights and noise precision subgraphs reduce 
to the value of the pilot symbols (or zero for data symbols), with all second-order terms vanishing. 
Note that, for this channel estimator, no update of the beliefs of the data symbols is performed. 
Equalization and decoding are done outside the VMP-SP framework. 

Additionally, a small modification in the processing corresponding to the noise precision 
subgraph is required. Note that, for the computation of the message mf^^x, the signal received 
at all -pilot and data- subcarriers is used, while only the signals received at pilot positions are 
utilized for channel weight estimation. This can be avoided by restricting this message to include 
only the observation at pilot positions, i.e. calculating m j^^^^ instead, where 

with N^^^ denoting the total number of pilots in a frame. 

The initialization for this estimator is simpler compared to that of the other receivers. It 
consists of setting the beliefs of the channel weight vector corresponding to each transmit antenna 
to a Gaussian prior with zero mean and zero covariance, while an initial value for the noise 
precision can be obtained from the signal received at pilot subcarriers. The receiver operates by 
sequentially updating the channel weight vectors corresponding to each transmitter hi, ... , Hm 
following the procedure described in Section IIV-B2I This is followed by an update of the noise 
precision parameter. The channel responses belonging to each transmit antenna obtained after 
convergence of the iterative estimator are fed to a MIMO maximum likelihood detector (MLD), 
followed by BCJR decoding. Thus, we can obtain BER performance results and benchmark them 
with analogous receiver structures using a different channel estimator. 

As we will see in the performance evaluation, this iterative estimator approximates the per- 
formance of a pilot-based joint LMMSE channel estimator with perfect knowledge of the noise 
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TABLE 11 

Summary of receiver structures 





Initialization 


Operation 


Receiver 


Channel Weights 


Transmitted Symbols 


Channel Weight Model 


Demodulation & Decoding 


PSC-DD 


Null mean and covariance 




Disjoint 




DJC-DD 


LMMSE estimator 


ML detector 


Joint 


Demodulation only 


DSC-DD 


LMMSE estimator 


ML detector 


Disjoint 


Demodulation only 


I-DJC-DD 


LMMSE estimator 


MLD + BCJR 


Joint 


Demodulation and decoding 


I-DSC-DD 


LMMSE estimator 


MLD + BCJR 


Disjoint 


Demodulation and decoding 



variance. The iterative estimator, however, presents the advantage of avoiding cumbersome matrix 
inversions depending on the specific values of the pilot-symbols. This estimator was presented 
(outside the context of message-passing techniques) in ||34| . A more detailed discussion of 
the computational advantages of this estimator over the LMMSE estimator is provided in this 
contribution. 

In the following, we refer to this receiver as Pilot-aided Sequential Channel estimation - Data 
Decoding (PSC-DD) receiver. 

The main characteristics of all the receivers presented above are summarized in Table HIl 

C. Numerical Results 

We evaluate separately the performance of the three architectures described in Section IV-B[ 
beginning with the simplest scheme, the PSC-DD receiver; we follow with the DJC-DD and 
DSC-DD receivers and conclude with the most advanced structures: the I-DJC-DD and I-DSC- 
DD receivers. 

In Fig. |71 the mean squared error (MSE) of the estimates of the channel weights obtained 
with the PSC-DD receiver is depicted. The MSE is plotted for three different Eh/No values 
as a function of the number of iterations performed. It is observed that the performance of 
the sequential pilot-based estimator approaches the performance of a joint LMMSE estimator 
with sufficient number of iterations. It is especially interesting to note the dependency of the 
number of iterations required for convergence on the E^/Nq value. For Ei,/Nq = — 2dB and 
Eb/No = 2dB, between 2 and 3 iterations are sufficient to achieve an MSE virtually equal to the 
LMMSE bound. When increasing Eb/No to 6dB, however, a minimum number of 5 iterations 
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Fig. 7. MSE of the estimates of the channel weights for the PSC-DD receiver versus the iteration index. 13 pilot symbols 
are inserted per OFDM frame. The dashed black lines represent the MSE obtained with pilot-based LMMSE joint channel 
estimation. 



— e — PSC-DD 1 iter 
— « — PSC-DD 2 iter 




Fig. 8. BER as a function of Et/No for the PSC-DD receiver with QPSK modulation. 13 pilot symbols are inserted per OFDM 
frame. The BER performance of a similar receiver using LMMSE channel estimation with knowledge of the noise variance is 
included as a reference. 
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Fig. 9. BER as a function of Et/No for tlie DJC-DD and DSC-DD receivers with QPSK modulation. 13 pilot symbols are 
inserted per OFDM frame. 

is needed, and about 10 iterations are required for E^/Nq = lOdB. Similar observations can 
be made when evaluating the BER of the receiver with QPSK modulation, as shown in Fig. [8l 
Again, the performance of the PSC-DD receiver equals that of the receiver with the LMMSE 
estimator when enough iterations for the receiver to converge have been run, and fewer iterations 
are needed the smaller Eb/No is. These results suggest that the iterative channel estimator in 
the PSC-DD receiver would be a good choice to obtain an initial channel estimate for the more 
complex iterative structures that we will discuss next. Furthermore, this channel estimator has 
the additional benefit of outputting soft estimates (the beliefs) of both the channel weights and 
the noise precision. Classical channel estimators, on the other hand, typically require separate 
noise estimation prior to the estimation of the channel weights, and only provide hard (point) 
estimates. 

BER results for the DJC-DD and DSC-DD receivers are portrayed in Fig. |9l The results have 
been obtained using a QPSK constellation for the modulation of data symbols. They indicate 
that a significant performance gain can be obtained by iteratively updating the channel weights, 
transmitted data symbols and noise precision parameter after the initialization, even though 
the receiver does not capitalize on the code structure within the iterative process. For both 
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Fig. 10. BER as a function of Eb/No for the I-DJC-DD and I-DSC-DD receivers with 16-QAM modulation. 13 pilot symbols 
are inserted per OFDM frame. 



receivers (with joint and sequential channel estimation), most of the improvement with respect 
to the initialization is obtained in the first three iterations, with only marginal gains obtained 
after further processing. Regarding the channel estimation approach, DJC-DD leads to a slightly 
better performance than DSC-DD in the whole simulated Eb/No range; the improved accuracy 
of the joint estimation approach comes at the expense of a larger computational complexity, 
as it operates with vectors and matrices of dimensions M times as large as in the sequential 
estimation approach, which can be a problem when calculating the necessary matrix inversions. 

Note that the receivers evaluated in Fig. [9] operate by capitalizing on the structure of the constel- 
lation used for the modulation of data symbols. Hence, their performance strongly depends on the 
type of modulation used. Low-order modulations, such as BPSK or QPSK, favor this receiver, 
as there is a relatively large distance between the points in the constellation, allowing better 
refining (through SP message-passing) of the VMP estimates of the transmitted symbols. When 
using higher order modulations, however, the receiver's performance suffers from the relatively 
small distance between adjacent constellation points. Specifically for the system investigated 
in this work, we found that the DJC-DD and DSC-DD receivers for 16-QAM or higher order 
modulations do not improve the performance with respect to the initialization. 
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The aforementioned problem with high-order modulations can be circumvented with the 
inclusion of the channel code structure in the iterative processing, as done in the I-DJC-DD 
and I-DSC-DD receivers. The BER performance of both receivers with 16-QAM modulated 
data symbols is depicted in Fig. \T0\ For benchmarking purposes, the BER performance of a 
heuristically designed iterative receiver with analogous features to the I-DJC-DD receiver is also 
plotted. We refer to this receiver as LMMSE receiver, as the channel estimation and MIMO 
detection modules are separately designed after the LMMSE principle. The LMMSE receiver 
is based on the design proposed in [9] for a multiuser CDMA receiver, and was adapted to 
MIMO-OFDM in BOl . where a detailed description of its operational principles can be found. 
In addition, the BER performance of a modified version of the I-DJC-DD receiver has also been 
included. This receiver, which we denote as I-DJC-DD(EM) receiver, results from applying the 
EM restriction to the beliefs of the channel weights h and the noise precision parameter A. Thus, 
this receiver is identical to the I-DJC-DD receiver except that the messages Uh^f^ and nx^j^ 
are computed according to ©. This modified messages imply that all terms depending on the 
second order moments of bh = ^/i^/o vanish. 

The results show that vast improvements in BER of the I-DJC-DD and I-DSC-DD receivers 
with respect to the initialization are obtained, even for very low Ei^/Nq values. As in the case of 
the DJC-DD and DSC-DD receivers, joint estimation of the channel weights performs marginally 
better than sequential estimation. Both message-passing receivers clearly outperform the heuristic 
LMMSE receiver, with E^/Nq gains close to IdB at a BER of 1%. We explain these gains by 
the fact that, contrary to the separate design of the different modules in the LMMSE receiver, 
our VMP-SP receivers are analytically derived based on a global objective function, namely the 
region-based free energy. This global design ensures that the information shared by the different 
receiver components is treated correctly, and resolves the choice of the appropriate type of 
information to be passed from the channel decoder to the other component parts of the receiver. 
It is also remarkable that the EM-constrained version of the I-DJC-DD receiver achieves roughly 
the same performance as the non-constrained version. This result seems to indicate that there 
is no significant gain to be achieved by computing soft channel estimates as compared to just 
point estimates, at least for the system considered. 

Another key feature of the I-DJC-DD and I-DSC-DD receivers is the estimation of the noise 
precision. This functionality does not only account for the AWGN, but also includes inaccuracies 
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Fig. 11. Average noise variance estimated by the I-DJC-DD receiver as a function of the iteration index. 13 pilot symbols are 
inserted per OFDM frame. The dashed black lines represent the true noise variance for each Eh/No value. 

in the estimates of the channel weights and data symbols. Fig. [TTI depicts the averaged noise 
variance estimate (inverse of the noise precision estimate A) provided by the I-DJC-DD receiver 
as a function of the iteration index for three different Eh/No levels. The true AWGN variances 
are also depicted as dashed black lines. It is apparent from the results that the behavior of the 
noise variance estimates (with respect to the true value) depends heavily on the regime in which 
the receiver is operating. For the very low Eh/No regime, the receiver significantly overestimates 
the noise variance; this is due to the low accuracy of the channel weights estimates and the large 
amount of errors in the estimates of the data symbols obtained. At the other extreme, for high 
E^/Nq values, the estimates of the channel weights and data symbols become so accurate (as it 
can be observed from the low BER values) that the noise variance estimate rapidly converges to 
the true AWGN variance, as the contribution of the estimates' inaccuracies becomes negligible. 
In the medium Ei,/Nq range, the noise variance estimate slowly converges to a value larger than 
the true variance, the difference between both values depending again on the accuracy of the 
other parameters' estimates. 

Conceptually, the estimate of the noise precision represents the amount of 'trust' that the 
algorithm has on the beliefs of the channel weights and data symbols. With high noise precision 
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values, the receiver has high confidence on these beliefs, leading to a rapid convergence towards 
a stable solution. On the contrary, low noise precision values will yield slower changes on the 
beliefs from one iteration to the next, resulting in a slower convergence rate. 

VI. Conclusion 

In this article we have used a hybrid VMP-SP message passing framework [|411 . [|42l for the 
design of iterative receivers for wireless communication. The framework has been applied to 
the factor graph of a generic MIMO-OFDM system. The messages obtained from the generic 
derivation have been used to obtain a set of receiver architectures ranging from computationally 
simple solutions to full-scale iterative architectures performing channel weight estimation, noise 
precision estimation, MIMO equalization and channel decoding. The performance of the pro- 
posed receivers has been assessed and compared to state-of-the-art solutions via Monte Carlo 
simulations. 

A fundamental contribution of this work is the application of a unified framework that jointly 
optimizes the receiver architecture based on a global cost function, namely the region-based 
variational free energy. The message-passing scheme used in this work can be obtained from 
the equations of the stationary points of a particular region-based free energy approximation. 
The resulting algorithm applies the VMP and SP algorithms to different parts of the graph and 
unequivocally defines how the messages of the two respective frameworks are to be combined. As 
a result, the hybrid technique allows for a convenient design of wireless receivers in which the SP 
algorithm is used for demodulation and channel decoding and the VMP algorithm is applied for 
channel weight estimation, noise covariance estimation and MIMO equalization. The connection 
between the specific receiver component parts is defined by the message-computation rules, in 
contrast to other approaches in which the selection of information to be exchanged among the 
specific receiver components is done based on numerical results and/or intuitive argumentation. 

We illustrate the application of the framework by applying it to the design of receivers in 
a MIMO-OFDM communications system. From the factor-graph representing the underlying 
probabilistic model, a set of generic messages exchanged between different parts of the model, 
represented by sub-graphs, is derived. We choose to split the factor graph in three main subgraphs 
corresponding to the channel weights prior model, the noise precision model and the modulation 
and coding constraints. The advantage of this modular approach is that it enables a scalable. 
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flexible design of the receiver in which the modification of a specific sub-graph does not modify 
the processing performed in other parts of the graph. Thus, a collection of different receiver 
architectures can be obtained by applying different initialization and scheduling strategies. 

In order to assess the performance of the receivers derived with the proposed framework, we 
define five specific instances of the generic message-passing receiver. The particular architectures 
selected span from to full-scale iterative schemes, in which the output of the channel decoder 
is used to refine the estimates of the channel parameters and the transmitted symbols, to low- 
complexity solutions, in which only pilot symbols are used for channel weight and noise variance 
estimation. This particular selection of receiver architectures serves as an illustration of how the 
tradeoff between computational complexity and receiver performance can be adjusted, with the 
generic message-passing receiver as a starting point. The numerical results, obtained via Monte 
Carlo simulations in a realistic MIMO-OFDM setup, confirm the effectiveness of the receivers 
derived following the hybrid VMP-SP framework. In particular, the convergence behavior of 
the receivers tested is especially remarkable. All receiver instances yield an improved or equal 
performance with increasing number of iterations, both in terms of BER and MSB of the channel 
weight estimates. We explain these favorable convergence properties by the use of the unique, 
global cost function from which the algorithm is derived. The estimation of the noise precision 
parameter, accounting for the uncertainty on the estimates of the channel weights and transmitted 
symbols in addition to the AWGN variance, is another key feature of the proposed architecture. 
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